The Multi-Agent Reinforcement Learning In Malmo Competition